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DETAILED ACTION 
NOTE: This action has not been made final due merely to recent court decisions 
regarding claims not being tied to a statutory class or transforming subject matter. See 
35 U.S.C. 101 rejection below. 



Response to Arguments 

1 . Applicant's arguments filed 10/09/2008 have been fully considered but they are 
not persuasive. 

Argument 1 (page 18): 

• "However, the portion of the claim element reciting "based on at least one 
non-structural measure of relevance" is still not accounted for in Corston." 

• "As such, Corston does not teach "relevance scores for the discourse 
constituents based on at least one non-structural measure of relevance" of 
claim 1" 

• "As such, Corston fails to teach or suggest elements of "determining 
relevance scores ... percolating the relevance scores" of claim 1 for which 
it is cited." 

Response to argument 1: 

Examiner takes the position that Marcu in view of Corston in fact teaches the 
limitations of the present invention, particularly claim 1 . Examiner would like to 
point out the parallel teachings of document retrieval in both the present 
invention and cited references (present invention [0013]), wherein keywords from 
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retrieved documents are analyzed, and a score is calculated to differentiate 
information in a document. Like Corston, the present invention utilizes a 
repository of documents to perform discourse analysis on. Thus, Corston 
teaches user input query in such techniques is typically presented as either an 
explicit user generated query, or an implicit query, such as when a user requests 
documents which are similar to a set of existing documents . Tvpicai information 
retrieval systems search documents in the larger data store at either a single 
word level, or at a term level . Each of the documents is assigned a relevancy (or 
similarity) score, and the information retrieval system presents a certain subset of 
the documents searched to the user, typically that subset which has a relevancy 
score which exceeds a given threshold (Corston Col. 1 line 59 - Col. 2 line 33). 

Regarding non-structured, Corston overcomes well known limitations of previous 
inventions, wherein Corston teaches statistical search engines stems from their 
assumption that words are independent variables, i.e., words in any textual 
passage occur independently of each other. Independence in this context means 
that a conditional probability of any one word appearing in a document given the 
presence of another word therein is always zero, i.e., a document simply 
contains an unstructured collection of words or simply put a "bag of words" . As 
one can readily appreciate, this assumption, with respect to any language, is 
grossly erroneous. English, like other languages, has a rich and complex 
syntactic and lexicon-semantic structure with words whose meanings vary , often 
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widely, based on tlie specific linguistic context in which they are used , with the 
context determining in any one instance a given meaning of a word and what 
word(s) can subsequently appear. Hence, words that appear in a textual 
passage are simply not independent of each other, rather thev are highly inter- 
dependent (Corston Col. 1 line 59 - Col. 2 line 33). Thus, Corston analyzes 
unstructured text with a more sophisticated form, wherein context is used. 

Regard both structured and unstructured text, wherein unstructured text can be 
input text hand typed by a user in an email, word processor, and even audio, 
Marcu teaches the present application relates to computational linguistics and 
more particularly to techniques for parsing a text to determine its underlying 
rhetorical, or discourse, structure, and to technigues for summarizing, or 
compressing, text ([0003]). 

Further, Marcu teaches discourse parsing systems and techniques as described 
herein. The systems and techniques described here result in a discourse parsing 
system that uses a set of learned decision rules to automatically determine the 
underlying discourse structure of any unrestricted text . As a result, the discourse 
parsing system can be used, among other ways, for constructing discourse trees 
whose leaves are sentences (or units that can be identified at high levels of 
performance) . Moreover, the time, expense, and inconsistencies associated with 
manually built discourse tree derivation rules are reduced dramatically ([0019]). 
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Furthermore, Marcu teaches the formation of unstructured text into structured 
text, wherein Marcu teaches a decision-based rhetorical parsing system 
(equivalently, a discourse parsing system) automatically derives the discourse 
structure of unrestricted texts and incrementally builds corresponding discourse 
trees based on a set of learned decision rules . The discourse parsing system 
uses a shift-reduce rhetorical parsing algorithm that learns to construct rhetorical 
structures of texts from a corpus of discourse-parse action sequences . The 
rhetorical parsing algorithm implements robust lexical, syntactic and semantic 
knowledge sources ([0067]). 



Claim Rejections - 35 USC § 101 

2. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

Claims 1-10 are rejected under 35 U.S.C. 101 because: 
Claims 1-10 fail to clearly recite a statutory process to which it is tied. Claims 1-10 
recite purely mental steps and would not qualify as a statutory process. In order to 
qualify as a statutory process, the method claim should positively recite the other 
statutory class to which it is tied (i.e. apparatus, device, product, etc.). For example, the 
method steps of claims 1 , 5, 6 ,and 8 appear to recite mental steps such as 
"determining a hybrid text summary" and do not identify an apparatus that performs the 
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recited nnetliod steps, such as "a client device" as described in the specification (present 
invention [0128]). 



Claim Rejections - 35 USC § 103 

3. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the phor art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner In which the Invention was made. 



4. Claims 1, 11, 22, 23, and 36 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Marcu et al U.S. PGPUB 20020046018 (herein after Marcu) in view 
of Corston et al. US 6901399 B1 (hereinafter Corston). 

Re claims 1 , 1 1 , 22, 23, and 36, Marcu teaches a method of determining a hybrid 
text summary comprising the steps of: 

determining discourse constituents for a text ([0010]); 

determining a structural representation of discourse for the text ([0003]); 

determining relevance scores for the discourse constituents based on at least 
one non- structural measure of relevance ([0003]); 

determining a hybrid text summary ([0003]) 

However, Marcu fails to teach determining relevance scores and percolating the 



relevance scores 
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Corston teaches that it is well know to use a relevancy score, wherein a user 
input query in such techniques is typically presented as either an explicit user generated 
query, or an implicit query, such as when a user requests documents which are similar 
to a set of existing documents. Typical information retrieval systems search documents 
in the larger data store at either a single word level, or at a term level. Each of the 
documents is assigned a relevancy (or similarity) score, and the information retrieval 
system presents a certain subset of the documents searched to the user, typically that 
subset which has a relevancy score which exceeds a given threshold (Corston Col.. 1 
line 59 -Col.. 2 line 2). 

Corston teaches statistical search engines stems from their assumption that 
words are independent variables, i.e., words in any textual passage occur 
independently of each other. Independence in this context means that a conditional 
probability of any one word appearing in a document given the presence of another 
word therein is always zero, i.e., a document simply contains an unstructured collection 
of words or simply put a "bag of words". As one can readily appreciate, this 
assumption, with respect to any language, is grossly erroneous. English, like other 
languages, has a rich and complex syntactic and lexicon-semantic structure with words 
whose meanings vary, often widely, based on the specific linguistic context in which 
they are used, with the context determining in any one instance a given meaning of a 
word and what word(s) can subsequently appear. Hence, words that appear in a textual 
passage are simply not independent of each other, rather they are highly inter- 
dependent (Corston Col. 1 line 59 - Col. 2 line 33). 
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based on the structural representation of discourse 

Corston teaches that even if the document does not have a title which contains 
the keyword, sentences in the document can be analyzed to determine the meta 
structure of the document. For example, the subjects of sentences, particularly subjects 
of sentences whose main verb is "be", tend to be the theme or topic of that sentence. 
Precision can be increased, even for keyword queries, by preferentially matching the 
keyword queries against documents containing sentences about that keyword (Corston 
Col.. 33 line 65 - Col.. 34 line 9). 

Corston teaches that a meta structure of the document is indicative of a general 
subject matter of the document, wherein obtaining the set of abstract logical forms 
indicative of a meta structure of the document comprises: obtaining the set of abstract 
logical forms based on formatting information corresponding to the document. 
Additional, obtaining the set of abstract logical forms indicative of a meta structure of the 
document comprises: obtaining the set of abstract logical forms based on topics of 
sentences in the document. Further, Corston teaches that obtaining the set of abstract 
logical forms indicative of a meta structure of the document comprises: obtaining the set 
of abstract logical forms based on subjects of sentences in the document (Corston Col.. 
45 lines 37-55). 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention determining a hybrid text summary based on constituents with 
relevance scores having a threshold score, where perCol.ation is performed based on a 
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relevance score. Percolating or passing relevant scores would allow for efficient 
clustering, where only a specific range of structured data that abides by relevant criteria 
will remain. 

5. Claim 10 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Marcu et al U.S. PGPUB 20020046018 (herein after Marcu) in view of Corston at al. 
US 6901399 B1 (hereinafter Corston) and further in view of Nakao US 6205456 B1 
(hereinafter Nakao). 

Re claim 1 0, Marcu falls to teach the method of claim 1 , wherein percolation of 
relevance scores comprises the steps of: 

Corston teaches percolating relevance score of important discourse constituents 
exceeding an importance threshold value (Corston Col.. 1 line 59 - Col.. 2 line 2). 

Corston teaches that it is well know to use a relevancy score, wherein a user 
input query in such techniques is typically presented as either an explicit user generated 
query, or an implicit query, such as when a user requests documents which are similar 
to a set of existing documents. Typical information retrieval systems search documents 
in the larger data store at either a single word level, or at a term level. Each of the 
documents is assigned a relevancy (or similarity) score, and the information retrieval 
system presents a certain subset of the documents searched to the user, typically that 
subset which has a relevancy score which exceeds a given threshold (Corston Col.. 1 
line 59 -Col.. 2 line 2). 
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Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention determining a hybrid text based on constituents with percolating 
relevance scores having a threshold score, where threshold comparison is performed 
based on a relevance score. Percolating relevance scores relevant to a threshold score 
would allow for efficient clustering, where only a specific range of data that abides by a 
relevance probability will remain. 

However, Marcu in view of Corston fails to teach determining important discourse 

constituents; 

determining unresolved anaphors of the important discourse constituents (Nakao 
Col. 4 line 12-29); 

determining potential resolving discourse constituents containing potential 
antecedent referent of the unresolved anaphors (Nakao Col. 4 line 12-29); 

through the reduced span of potential resolving discourse constituents (Nakao 
Col. 4 line 12-29); 

determining a modified span of the important discourse constituents based on 
the relevance score (Nakao Col. 4 line 12-29). 

Nakao teaches for an anaphoric expression, its antecedent is searched for and 
the anaphoric expression is replaced with the antecedent or a portion containing the 
antecedent is included in a summary so that the summary can be easily understood. 
The antecedent of the anaphoric expression can be identified by a method referred to 
as a centering method. This method makes a list of centers that comprises probable 
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elements (centers) of a sentence to be antecedents of anaphoric expressions in the 
subsequent sentences. The elements probability to be an antecedent is calculated 
mainly by its syntactic role in a sentence, such as subject, direct object, etc. Then, the 
method resolves an anaphoric expression by selecting the most probable element from 
the list with the restriction of agreement of number, gender, etc. 

NOTE: For purposes of prior art, a reduced span is construed to be both 
functionally equivalent and equally effective as a resolved set with the most probable 
elements present. 

Therefore, it would have been obvious to one of ordinary skill in the art at the 
time of the invention determining portions of text containing anaphors, where an 
antecedent referent can be substituted for the anaphor, and removing the antecedent 
referent relevant to a relevancy threshold. Using an anaphor to replace an antecedent 
referent would allow for a preservation of memory, where more than one item can refer 
to a single item (i.e. anaphoric relationship). Additionally, using syntactic analysis with 
antecedent and anaphoric substitutions would allow for a minimized set of data that 
would the most probable with respect to the discourse/text by reduction of a set of 
candidate data. Additionally, percolating scores of various elements relevant to 
probability and antecedent basis would allow for a further reduced set that is minimized 
for both anaphor and original data from the discourse. 



Allowable Subject Matter 

6. Claims 2-9, 1 2-21 , 34, 35, and 37 are allowed. 
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Any inquiry concerning this communication or earlier communications from tine 
examiner should be directed to Michael C. Colucci whose telephone number is (571)- 
270-1847. The examiner can normally be reached on 9:30 am - 6:00 pm, Monday- 
Friday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571 )-272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Michael C Colucci/ 
Examiner, Art Unit 2626 
Patent Examiner 
AU 2626 
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Supervisory Patent Examiner, Art Unit 2626 



